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SEQUENCE LISTING 
GENERAL INFORMATION: 
(i) APPLICANT: Blank, Gregory S . 

Narindray, Dal jit S. 
Zapata, Gerardo A. 
TITLE OF INVENTION: Protein Recovery 
NUMBER OF SEQUENCES: 7 
CORRESPONDENCE ADDRESS: 
(A) ADDRESSEE: Genentech, Inc. 
STREET: 1 DNA Way 
CITY: South San Francisco 
STATE: California 
COUNTRY: USA 
ZIP: 94080 
COMPUTER READABLE FORM: * 

(A) MEDIUM TYPE : 3.5 inch, 1.44 Mb floppy drsk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WinPatin (Genentech) 
CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/940 f 166 

(B) FILING DATE: 27-Aug-2001 

(C) CLASSIFICATION: 
PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/097,309 

(B) FILING DATE: 1998-06-12 
ATTORNEY/AGENT INFORMATION: 

(A) NAME: Schwartz, Timothy R. 

(B) REGISTRATION NUMBER: 32171 

(C) REFERENCE/DOCKET NUMBER: P1105R1 
TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650/225-7467 

(B) TELEFAX: 650/952-9881 
INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: Amino Acid 
( d ) TOPOLOGY : Linear 

SEQUENCE DESCRIPTION: SEQ ID NO 
Gin Leu Val Glu Ser Gly Gly Gly 
5 

Leu Ser Cys Ala Thr 
20 

Glu Tyr Thr Met His Trp Met Arg Gin 
35 

Glu Trp Val Ala Gly He Asn 
50 



ENTERED 



(v) 



(vi) 



(vii) 



(viii) 



(ix) 



(2) 



(xi) 
Glu Val 



Gly Ser Leu Arg 



Leu Val Gin Pro Gly 

10 I 5 

Ser Gly Tyr Thr Phe Thr 

25 30 

Ala Pro Gly Lys Gly Leu 

40 45 
Pro Lys Asn Gly Gly Thr Ser His 

55 60 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/940 , 166 



DATE: 11/13/2001 
TIME: 15:21:11 
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Input Set : N:\Crf3\RULE60\09940166.txt 
Output Set: N:\CRF3\11132001\I940166.raw 

Asn Gin Arg Phe Met Asp Arg Phe Thr lie Ser Val Asp Lys Ser 

Thr Ser Thr Ala Tyr Met Gin Met Asn Ser Leu Arg Ala Glu Asp 

80 85 * 

Thr Ala Val Tyr Tyr Cys Ala Arg Trp Arg Gly Leu Asn Tyr Gly 

95 100 ±KJ3 

Phe Asp Val Arg Tyr Phe Asp Val Trp Gly Gin Gly Thr Leu Val 

110 115 

Thr val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu 

Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly 

Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp 

Asn Ser Gly Ala Leu Thr Ser Gly Val Hi. Thr Phe Pro Ala Val 

Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val 
185 190 , J 95 

Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He Cys Asn Val Asn 

200 205 r 

His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys 

215 220 
Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu 
230 235 

Leu 
241 



104 (2) INFORMATION FOR SEQ ID NO : 2: 
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137 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
1 5 10 15 

Gly Asp Arg Val Thr He Thr Cys Arg Ala Ser Gin Asp He Asn 

Asn Tyr Leu Asn Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys 

35 40 45 

Leu Leu He Tyr Tyr Thr Ser Thr Leu His Ser Gly Val Pro Ser 

50 55 
Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr lie 

65 70 
Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin 

80 85 90 

Gly Asn Thr Leu Pro Pro Thr Phe Gly Gin Gly Thr Lys Val Glu 
1 g 5 100 10b 

lie Lys Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro 

Ser Asp Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu 
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125 130 135 

L eu Asn Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val 

ASP Asn Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu 

Gin Asp ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr 

Leu ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu 

185 190 
val Thr His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn 

200 20b 



Leu Ser Lys Asn Tyr His Leu Glu Asn Glu Val Ala Arg Leu Lys 



20 25 



138 
140 
141 
143 
144 
146 
147 
149 
150 
152 
153 

155 Arg Gly Glu Cys 

156 214 

158 (2) INFORMATION FOR SEQ ID NO: 3: 

160 (D SEQUENCE CHARACTERISTICS: 

161 (A) LENGTH: 36 amino acids 

162 (B) TYPE: Amino Acid 

163 (D) TOPOLOGY: Linear 

i£R f X H SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

111 Leu'Sy lly Arg Met Lys Gin Leu Glu Asp Lys Val Glu Glu Leu 

5 1 f 1 

.LOO 

170 
171 

173 Lys Leu Val Gly Glu Arg 

174 35 36 

176 (2) INFORMATION FOR SEQ ID NO: 4: 

178 (i) SEQUENCE CHARACTERISTICS: 

179 (A) LENGTH: 7 amino acids 

180 (B) TYPE: Amino Acid 

181 (D) TOPOLOGY: Linear 

183 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

W--> 185 Leu Xaa Xaa Xaa Xaa Xaa Xaa 

186 1 5 7 

188 (2) INFORMATION FOR SEQ ID NO : b: 

190 (i) SEQUENCE CHARACTERISTICS: 

191 (A) LENGTH: 2143 base pairs 

192 (B) TYPE: Nucleic Acid 

193 (C) STRANDEDNESS : Single 

194 (D) TOPOLOGY: Linear 

1<Jfi fxi} SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

III GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC ATGAAAAATC 50 

201 tcIttgotgI GTTGTTATTT AAGCTTTGGA GATTATCGTC ACTGCAATGC 100 

203 TTCGcSaT GGCGCAAAAT GACCAACAGC GGTTGATTGA TCAGGTAGAG 150 

Inl rrGGCGCTGT ACGAGGTAAA GCCCGATGCC AGCATTCCTG ACGACGATAC 200 

,07 GGAGCTGCTG CGCGATTACG TAAAGAAGTT ATTGAAGCAT CCTCGTCAGT 250 

209 SaaStIa tcttttcaac AGCTGTCATA AAGTTGTCAC GGCCGAGACT 300 

211 T^TCGCT TTGTTTTTAT TTTTTAATGT ATTTGTAACT ^AATTCGAG 350 

Vi\ JJrGCCGGGG ATCCTCTAGA GGTTGAGGTG ATTTTATGAA AAAGAATATC 400 
215 GCATTTCTTC TTGCATCTAT GTTCGTTTTT TCTATTGCTA CAAACGCGTA 4 50 
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CGCTGATATC CAGATGACCC AGTCCCCGAG CTCCCTGTCC GCCTCTGTGG 500 
nn^llfrrcl CACCATCACC TGTCGTGCCA GTCAGGACAT CAACAATTAT 550 
? T GaIcTGGT ATcScAgS ACCAGGAAAA GCTCCGAAAC TACTGATTTA 600 
^CCTCC ACcScCACT CTGGAGTCCC TTCTCGCTTC TCTGGTTCTG 650 
g^ctgggac ggatScact CTGACCATCA GCAGTCTGCA ACCGGAGGAC 700 
SgCAACTT ATTACTGTCA GCAAGGTAAT ACTCTGCCGC CGACGTTCGG 750 
ACAGGGCACG AAGGTGGAGA TCAAACGAAC TGTGGCTGCA CCATCTGTCT 800 
TCATCTTCCC GCCATCTGAT GAGCAGTTGA AATCTGGAAC TGCCTCTGTT 850 
GTGTGCCTGC TGAATAACTT CTATCCCAGA GAGGCCAAAG TACAGTGGAA 900 
SS^Sr rrcCTCCAAT CGGGTAACTC CCAGGAGAGT GTCACAGAGC 950 
AGGACAGCAA GGAcIgCA^C TACAGCCTCA GCAGCACCCT G^CGCTGAGC 1000 
^rrAGACT ACGAGAAACA CAAAGTCTAC GCCTGCGAAG TCACCCATCA 1050 
Sgcctgagc tcgccStca CAAAGAGCTT CAACAGGGGA GAGTGTTAAG 1100 
ctgScctct Icgccggacg CATCGTGGCG ctagtacgca agttcacgta 1 50 
Saacggtat ctagaggttg aggtgatttt atgaaaaaga atatcgcatt 1200 
t^cttgc! tctatgttcg ttttttctat tgctacaaac gcgtacgctg 1250 
IggttcIgct ggtggagtct ggcggtggcc tggtgcagcc AGGGGGCTCA 1300 

CTCCGTTTGT CCTGTGCAAC TTCTGGCTAC ACCTTTACCG AATACACTAT 1350 
GCACTGGATG CGTCAGGCCC CGGGTAAGGG CCTGGAATGG GTTGCAGGGA 1400 
TTaScCTaI AAACGGTGGT ACCAGCCACA ACCAGAGGTT CATGGACCGT 1450 
^GtSS GCGTAGATAA ATCCACCAGT ACAGCCTACA TGCAAATGAA 1500 
cIgCCTGCGT GCTGAGGACA CTGCCGTCTA TTATTGTGCT AGATGGCGAG 1550 
GCCTGAACtI CGGCTTTGAC GTCCGTTATT TTGACGTCTG GGGTCAAGGA 1600 
accctgItca CCGTCTCCTC GGCCTCCACC AAGGGCCCAT CGGTCTTCCC 1650 
CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG GCCCTGGGCT 1700 
GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 1750 
?gcgcccSa~ ccagcggcgt GCACACCTTC CCGGCTGTCC TACAGTCCTC 1800 
AGGACTCTAC TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG 1850 
GCACCCAGAC CTACATCTGC AACGTGAATC ACAAGCCCAG CAACACCAAG 1900 
GTCGACaIgA SSSgAGCC CAAATCTTGT GACAAAACTC ACACATGCCC 1950 
GCCGTGCCCA GCACCAGAAC TGCTGGGCGG CCGCATGAAA CAGCTAGAGG 2000 
ACAAGGTCGA AGAGCTACTC TCCAAGAACT ACCACCTAGA GAATGAAGTG 2050 
GCAAGACTCA AAAAGCTTGT CGGGGAGCGC TAAGCATGCG ACCGCCCTAG 2100 



287 (i) SEQUENCE CHARACTERISTICS: 

288 (A) LENGTH: 237 amino acids 
2 Q9 (B) TYPE: Amino Acid 

290 (D) TOPOLOGY: Linear 

092 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

III Met^ys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 

29? Ser lie Ala Thr Asn Ala Tyr Ala Asp He Gin Met Thr Gin Ser 

_ Q 1 ° 



298 " 5 
Pro Ser Ser Leu 
10 

Cys Arg Ala Ser 

306 Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr Tyr Thr Ser 



Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr 

Asn Tyr Leu Asn T 

304 25 30 



300 15 

3°03 cys Arg Ala Ser Gin Asp He Asn Asn Tyr Leu Asn Trp Tyr Gin 
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40 45 
Thr Leu His Ser Gly Val Pro Ser Arg Phe 

55 60 
Gly Thr Asp Tyr Thr Leu Thr He Ser Ser 

70 75 
Phe Ala Thr Tyr Tyr Cys Gin Gin Gly Asn 

85 90 
Phe Gly Gin Gly Thr Lys Val Glu He Lys 

100 1° 5 
Pro Ser Val Phe He Phe Pro Pro Ser Asp 

115 120 
Gly Thr Ala Ser Val Val Cys Leu Leu Asn 

130 135 
Glu Ala Lys Val Gin Trp Lys Val Asp Asn 

145 150 
Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

160 165 
Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 

175 18° 
Lys His Lys Val Tyr Ala Cys Glu Val Thr 

190 195 
Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 

205 210 
(2) INFORMATION FOR SEQ ID NO : 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: Amino Acid 
( D ) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
Met Lys Lys Asn He Ala Phe Leu Leu Ala 
-23 "20 "I 5 

Ser He Ala Thr Asn Ala Tyr Ala Glu Val 

-5 1 
Gly Gly Gly Leu Val Gin Pro Gly Gly Ser 

10 I 5 
Ala Thr Ser Gly Tyr Thr Phe Thr Glu Tyr 

25 30 
Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp 

40 45 
Pro Lys Asn Gly Gly Thr Ser His Asn Gin 

55 60 
Phe Thr He Ser Val Asp Lys Ser Thr Ser 

70 75 
Met Asn Ser Leu Arg Ala Glu Asp Thr Ala 

85 90 
Arg Trp Arg Gly Leu Asn Tyr Gly Phe Asp 

100 105 
Val Trp Gly Gin Gly Thr Leu Val Thr Val 
115 120 



Ser Gly 

Leu Gin 

Thr Leu 

Arg Thr 

Glu Gin 

Asn Phe 

Ala Leu 

Ser Lys 

Lys Ala 

His Gin 

Glu Cys 
214 



50 

Ser Gly Ser 
65 

Pro Glu Asp 
80 

Pro Pro Thr 
95 

Val Ala Ala 
110 

Leu Lys Ser 
125 

Tyr Pro Arg 
140 

Gin Ser Gly 
155 

Asp Ser Thr 
170 

Asp Tyr Glu 
185 

Gly Leu Ser 
200 



Ser Met 
Gin Leu 
Leu Arg 
Thr Met 
Val Ala 
Arg Phe 
Thr Ala 
Val Tyr 
Val Arg 
Ser Ser 



Phe Val Phe 
-10 

Val Glu Ser 
5 

Leu Ser Cys 
20 

His Trp Met 
35 

Gly He Asn 
50 

Met Asp Arg 
65 

Tyr Met Gin 
80 

Tyr Cys Ala 
95 

Tyr Phe Asp 
110 

Ala Ser Thr 
125 
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L:28 M:22 0 c: Keyword .isspelle, or invalid format [( A) 
L:29 M-.220 C: Keyword misspelled or inva ^« * °™at 1 < B > 
L:185 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#.4 
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